Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 94881 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 8.0 MiB |
| Average record size in memory | 88.0 B |
Variable types
| Categorical | 1 |
|---|---|
| DateTime | 1 |
| Numeric | 9 |
id_estacion has a high cardinality: 207 distinct values | High cardinality |
tmax is highly correlated with tmin | High correlation |
tmin is highly correlated with tmax | High correlation |
longitud is highly correlated with latitud | High correlation |
latitud is highly correlated with longitud | High correlation |
tmax is highly correlated with tmin | High correlation |
tmin is highly correlated with tmax | High correlation |
tmax is highly correlated with tmin | High correlation |
tmin is highly correlated with tmax | High correlation |
longitud is highly correlated with latitud and 1 other fields | High correlation |
latitud is highly correlated with longitud and 1 other fields | High correlation |
tmin is highly correlated with altitud and 2 other fields | High correlation |
altitud is highly correlated with longitud and 3 other fields | High correlation |
fecha_cnt is highly correlated with tmin and 1 other fields | High correlation |
tmax is highly correlated with tmin and 2 other fields | High correlation |
nevada is highly skewed (γ1 = 133.5740724) | Skewed |
prof_nieve is highly skewed (γ1 = 62.17306945) | Skewed |
precip has 13169 (13.9%) zeros | Zeros |
nevada has 94869 (> 99.9%) zeros | Zeros |
prof_nieve has 93937 (99.0%) zeros | Zeros |
Reproduction
| Analysis started | 2021-10-09 13:00:21.422362 |
|---|---|
| Analysis finished | 2021-10-09 13:00:33.105205 |
| Duration | 11.68 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 207 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 741.4 KiB |
| SP000009981 | 1391 |
|---|---|
| SP000008280 | 1329 |
| SP000003195 | 1218 |
| SPE00120629 | 1210 |
| SPE00155259 | 1206 |
| Other values (202) |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Characters and Unicode
| Total characters | 1043691 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SP000003195 |
|---|---|
| 2nd row | SP000003195 |
| 3rd row | SP000003195 |
| 4th row | SP000003195 |
| 5th row | SP000003195 |
Common Values
| Value | Count | Frequency (%) |
| SP000009981 | 1391 | 1.5% |
| SP000008280 | 1329 | 1.4% |
| SP000003195 | 1218 | 1.3% |
| SPE00120629 | 1210 | 1.3% |
| SPE00155259 | 1206 | 1.3% |
| SP000060010 | 1198 | 1.3% |
| SP000008027 | 1130 | 1.2% |
| SP000007038 | 1099 | 1.2% |
| SPE00119711 | 1090 | 1.1% |
| SPE00120458 | 1088 | 1.1% |
| Other values (197) | 82922 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| sp000009981 | 1391 | 1.5% |
| sp000008280 | 1329 | 1.4% |
| sp000003195 | 1218 | 1.3% |
| spe00120629 | 1210 | 1.3% |
| spe00155259 | 1206 | 1.3% |
| sp000060010 | 1198 | 1.3% |
| sp000008027 | 1130 | 1.2% |
| sp000007038 | 1099 | 1.2% |
| spe00119711 | 1090 | 1.1% |
| spe00120458 | 1088 | 1.1% |
| Other values (197) | 82922 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 319330 | |
| 1 | 131987 | |
| S | 94881 | 9.1% |
| P | 94881 | 9.1% |
| 2 | 79522 | 7.6% |
| E | 76032 | 7.3% |
| 5 | 47507 | 4.6% |
| 9 | 47023 | 4.5% |
| 6 | 34715 | 3.3% |
| 8 | 33383 | 3.2% |
| Other values (5) | 84430 | 8.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 774846 | |
| Uppercase Letter | 268845 | 25.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 319330 | |
| 1 | 131987 | |
| 2 | 79522 | 10.3% |
| 5 | 47507 | 6.1% |
| 9 | 47023 | 6.1% |
| 6 | 34715 | 4.5% |
| 8 | 33383 | 4.3% |
| 3 | 31430 | 4.1% |
| 4 | 29670 | 3.8% |
| 7 | 20279 | 2.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 94881 | |
| P | 94881 | |
| E | 76032 | |
| W | 1903 | 0.7% |
| M | 1148 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 774846 | |
| Latin | 268845 | 25.8% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 319330 | |
| 1 | 131987 | |
| 2 | 79522 | 10.3% |
| 5 | 47507 | 6.1% |
| 9 | 47023 | 6.1% |
| 6 | 34715 | 4.5% |
| 8 | 33383 | 4.3% |
| 3 | 31430 | 4.1% |
| 4 | 29670 | 3.8% |
| 7 | 20279 | 2.6% |
Latin
| Value | Count | Frequency (%) |
| S | 94881 | |
| P | 94881 | |
| E | 76032 | |
| W | 1903 | 0.7% |
| M | 1148 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1043691 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 319330 | |
| 1 | 131987 | |
| S | 94881 | 9.1% |
| P | 94881 | 9.1% |
| 2 | 79522 | 7.6% |
| E | 76032 | 7.3% |
| 5 | 47507 | 4.6% |
| 9 | 47023 | 4.5% |
| 6 | 34715 | 3.3% |
| 8 | 33383 | 3.2% |
| Other values (5) | 84430 | 8.1% |
fecha
Date
| Distinct | 1465 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 741.4 KiB |
| Minimum | 1896-11-30 00:00:00 |
|---|---|
| Maximum | 2021-08-31 00:00:00 |
Histogram with fixed size bins (bins=50)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.496759098 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 741.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.447215885 |
|---|---|
| Coefficient of variation (CV) | 0.5306054654 |
| Kurtosis | -1.212344505 |
| Mean | 6.496759098 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.00209677699 |
| Sum | 616419 |
| Variance | 11.88329736 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=12)
| Value | Count | Frequency (%) |
| 3 | 7966 | |
| 5 | 7964 | |
| 4 | 7956 | |
| 6 | 7936 | |
| 7 | 7935 | |
| 1 | 7927 | |
| 8 | 7924 | |
| 9 | 7883 | |
| 10 | 7878 | |
| 12 | 7878 | |
| Other values (2) | 15634 |
| Value | Count | Frequency (%) |
| 1 | 7927 | |
| 2 | 7760 | |
| 3 | 7966 | |
| 4 | 7956 | |
| 5 | 7964 | |
| 6 | 7936 | |
| 7 | 7935 | |
| 8 | 7924 | |
| 9 | 7883 | |
| 10 | 7878 |
| Value | Count | Frequency (%) |
| 12 | 7878 | |
| 11 | 7874 | |
| 10 | 7878 | |
| 9 | 7883 | |
| 8 | 7924 | |
| 7 | 7935 | |
| 6 | 7936 | |
| 5 | 7964 | |
| 4 | 7956 | |
| 3 | 7966 |
| Distinct | 447 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 200.2001876 |
| Minimum | -53 |
|---|---|
| Maximum | 403 |
| Zeros | 14 |
| Zeros (%) | < 0.1% |
| Negative | 236 |
| Negative (%) | 0.2% |
| Memory size | 741.4 KiB |
Quantile statistics
| Minimum | -53 |
|---|---|
| 5-th percentile | 88 |
| Q1 | 148 |
| median | 198 |
| Q3 | 255 |
| 95-th percentile | 316 |
| Maximum | 403 |
| Range | 456 |
| Interquartile range (IQR) | 107 |
Descriptive statistics
| Standard deviation | 71.25637968 |
|---|---|
| Coefficient of variation (CV) | 0.3559256389 |
| Kurtosis | -0.5046758729 |
| Mean | 200.2001876 |
| Median Absolute Deviation (MAD) | 53 |
| Skewness | -0.02638990299 |
| Sum | 18995194 |
| Variance | 5077.471645 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 166 | 547 | 0.6% |
| 171 | 543 | 0.6% |
| 164 | 535 | 0.6% |
| 154 | 529 | 0.6% |
| 178 | 519 | 0.5% |
| 158 | 515 | 0.5% |
| 174 | 514 | 0.5% |
| 167 | 510 | 0.5% |
| 172 | 507 | 0.5% |
| 180 | 505 | 0.5% |
| Other values (437) | 89657 |
| Value | Count | Frequency (%) |
| -53 | 1 | < 0.1% |
| -50 | 1 | < 0.1% |
| -49 | 1 | < 0.1% |
| -47 | 1 | < 0.1% |
| -46 | 1 | < 0.1% |
| -45 | 1 | < 0.1% |
| -44 | 1 | < 0.1% |
| -43 | 1 | < 0.1% |
| -42 | 1 | < 0.1% |
| -41 | 3 |
| Value | Count | Frequency (%) |
| 403 | 1 | < 0.1% |
| 401 | 1 | < 0.1% |
| 400 | 1 | < 0.1% |
| 398 | 1 | < 0.1% |
| 396 | 1 | < 0.1% |
| 395 | 2 | |
| 391 | 1 | < 0.1% |
| 389 | 2 | |
| 388 | 4 | |
| 387 | 1 | < 0.1% |
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98.85705252 |
| Minimum | -121 |
|---|---|
| Maximum | 254 |
| Zeros | 197 |
| Zeros (%) | 0.2% |
| Negative | 5003 |
| Negative (%) | 5.3% |
| Memory size | 741.4 KiB |
Quantile statistics
| Minimum | -121 |
|---|---|
| 5-th percentile | -2 |
| Q1 | 53 |
| median | 98 |
| Q3 | 148 |
| 95-th percentile | 199 |
| Maximum | 254 |
| Range | 375 |
| Interquartile range (IQR) | 95 |
Descriptive statistics
| Standard deviation | 62.26292305 |
|---|---|
| Coefficient of variation (CV) | 0.6298278319 |
| Kurtosis | -0.6619987427 |
| Mean | 98.85705252 |
| Median Absolute Deviation (MAD) | 48 |
| Skewness | -0.07068337551 |
| Sum | 9379656 |
| Variance | 3876.671587 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 86 | 569 | 0.6% |
| 99 | 563 | 0.6% |
| 81 | 561 | 0.6% |
| 74 | 561 | 0.6% |
| 80 | 558 | 0.6% |
| 66 | 555 | 0.6% |
| 82 | 554 | 0.6% |
| 84 | 553 | 0.6% |
| 77 | 551 | 0.6% |
| 75 | 550 | 0.6% |
| Other values (356) | 89306 |
| Value | Count | Frequency (%) |
| -121 | 1 | < 0.1% |
| -115 | 1 | < 0.1% |
| -114 | 1 | < 0.1% |
| -113 | 3 | |
| -112 | 3 | |
| -110 | 2 | |
| -109 | 2 | |
| -108 | 2 | |
| -106 | 2 | |
| -105 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 254 | 3 | |
| 252 | 1 | < 0.1% |
| 251 | 4 | |
| 250 | 4 | |
| 249 | 1 | < 0.1% |
| 248 | 2 | |
| 247 | 1 | < 0.1% |
| 246 | 3 | |
| 245 | 3 | |
| 244 | 4 |
| Distinct | 223 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.24936499 |
| Minimum | 0 |
|---|---|
| Maximum | 422 |
| Zeros | 13169 |
| Zeros (%) | 13.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 741.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 10 |
| Q3 | 22 |
| 95-th percentile | 53 |
| Maximum | 422 |
| Range | 422 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 19.80259663 |
|---|---|
| Coefficient of variation (CV) | 1.218668953 |
| Kurtosis | 17.20503659 |
| Mean | 16.24936499 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 2.939634725 |
| Sum | 1541756 |
| Variance | 392.1428333 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 13169 | 13.9% |
| 1 | 5127 | 5.4% |
| 2 | 4310 | 4.5% |
| 3 | 3800 | 4.0% |
| 4 | 3650 | 3.8% |
| 5 | 3340 | 3.5% |
| 6 | 3214 | 3.4% |
| 7 | 3016 | 3.2% |
| 8 | 2919 | 3.1% |
| 9 | 2819 | 3.0% |
| Other values (213) | 49517 |
| Value | Count | Frequency (%) |
| 0 | 13169 | |
| 1 | 5127 | 5.4% |
| 2 | 4310 | 4.5% |
| 3 | 3800 | 4.0% |
| 4 | 3650 | 3.8% |
| 5 | 3340 | 3.5% |
| 6 | 3214 | 3.4% |
| 7 | 3016 | 3.2% |
| 8 | 2919 | 3.1% |
| 9 | 2819 | 3.0% |
| Value | Count | Frequency (%) |
| 422 | 1 | |
| 371 | 1 | |
| 320 | 1 | |
| 309 | 1 | |
| 305 | 1 | |
| 299 | 1 | |
| 280 | 1 | |
| 279 | 1 | |
| 259 | 1 | |
| 252 | 1 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0002951065018 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 94869 |
| Zeros (%) | > 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 741.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.03045324718 |
|---|---|
| Coefficient of variation (CV) | 103.1940909 |
| Kurtosis | 21217.72797 |
| Mean | 0.0002951065018 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 133.5740724 |
| Sum | 28 |
| Variance | 0.0009274002637 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=6)
| Value | Count | Frequency (%) |
| 0 | 94869 | |
| 2 | 6 | < 0.1% |
| 1 | 3 | < 0.1% |
| 4 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 94869 | |
| 1 | 3 | < 0.1% |
| 2 | 6 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 2 | 6 | < 0.1% |
| 1 | 3 | < 0.1% |
| 0 | 94869 |
| Distinct | 170 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4669533416 |
| Minimum | 0 |
|---|---|
| Maximum | 1834 |
| Zeros | 93937 |
| Zeros (%) | 99.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 741.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1834 |
| Range | 1834 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 14.85144419 |
|---|---|
| Coefficient of variation (CV) | 31.80498536 |
| Kurtosis | 5226.968602 |
| Mean | 0.4669533416 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 62.17306945 |
| Sum | 44305 |
| Variance | 220.5653945 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 93937 | |
| 1 | 302 | 0.3% |
| 2 | 123 | 0.1% |
| 3 | 67 | 0.1% |
| 4 | 43 | < 0.1% |
| 5 | 30 | < 0.1% |
| 6 | 18 | < 0.1% |
| 7 | 17 | < 0.1% |
| 11 | 10 | < 0.1% |
| 15 | 9 | < 0.1% |
| Other values (160) | 325 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 93937 | |
| 1 | 302 | 0.3% |
| 2 | 123 | 0.1% |
| 3 | 67 | 0.1% |
| 4 | 43 | < 0.1% |
| 5 | 30 | < 0.1% |
| 6 | 18 | < 0.1% |
| 7 | 17 | < 0.1% |
| 8 | 8 | < 0.1% |
| 9 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 1834 | 1 | |
| 1494 | 1 | |
| 1168 | 1 | |
| 1073 | 1 | |
| 1017 | 1 | |
| 892 | 1 | |
| 787 | 1 | |
| 784 | 1 | |
| 709 | 1 | |
| 666 | 1 |
| Distinct | 201 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.66353835 |
| Minimum | 27.8189 |
|---|---|
| Maximum | 43.5667 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 741.4 KiB |
Quantile statistics
| Minimum | 27.8189 |
|---|---|
| 5-th percentile | 28.4775 |
| Q1 | 38.282 |
| median | 40.8206 |
| Q3 | 42.0831 |
| 95-th percentile | 43.3669 |
| Maximum | 43.5667 |
| Range | 15.7478 |
| Interquartile range (IQR) | 3.8011 |
Descriptive statistics
| Standard deviation | 3.767045748 |
|---|---|
| Coefficient of variation (CV) | 0.09497503009 |
| Kurtosis | 3.134861779 |
| Mean | 39.66353835 |
| Median Absolute Deviation (MAD) | 1.6197 |
| Skewness | -1.844457219 |
| Sum | 3763316.182 |
| Variance | 14.19063367 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 40.8206 | 1391 | 1.5% |
| 38.9519 | 1329 | 1.4% |
| 40.4117 | 1218 | 1.3% |
| 41.1144 | 1210 | 1.3% |
| 41.4181 | 1206 | 1.3% |
| 28.3089 | 1198 | 1.3% |
| 38.9892 | 1192 | 1.3% |
| 40.9478 | 1155 | 1.2% |
| 43.3075 | 1130 | 1.2% |
| 37.9769 | 1099 | 1.2% |
| Other values (191) | 82753 |
| Value | Count | Frequency (%) |
| 27.8189 | 573 | |
| 27.9225 | 633 | |
| 28.0475 | 493 | |
| 28.3089 | 1198 | |
| 28.4444 | 665 | |
| 28.4631 | 1088 | |
| 28.4775 | 951 | |
| 28.6331 | 654 | |
| 28.9517 | 640 | |
| 35.2778 | 715 |
| Value | Count | Frequency (%) |
| 43.5667 | 634 | |
| 43.5606 | 524 | |
| 43.5381 | 750 | |
| 43.4917 | 613 | |
| 43.4644 | 878 | |
| 43.4292 | 708 | |
| 43.3669 | 1090 | |
| 43.3606 | 764 | |
| 43.3542 | 585 | |
| 43.3075 | 1130 |
| Distinct | 206 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -3.4350247 |
| Minimum | -17.8889 |
|---|---|
| Maximum | 4.2156 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 66473 |
| Negative (%) | 70.1% |
| Memory size | 741.4 KiB |
Quantile statistics
| Minimum | -17.8889 |
|---|---|
| 5-th percentile | -16.2553 |
| Q1 | -5.6417 |
| median | -3.45 |
| Q3 | 0.4914 |
| 95-th percentile | 2.3767 |
| Maximum | 4.2156 |
| Range | 22.1045 |
| Interquartile range (IQR) | 6.1331 |
Descriptive statistics
| Standard deviation | 4.699469291 |
|---|---|
| Coefficient of variation (CV) | -1.368103493 |
| Kurtosis | 1.512397182 |
| Mean | -3.4350247 |
| Median Absolute Deviation (MAD) | 2.6056 |
| Skewness | -1.173117137 |
| Sum | -325918.5786 |
| Variance | 22.08501162 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -3.7892 | 1506 | 1.6% |
| 0.4914 | 1391 | 1.5% |
| -1.8631 | 1329 | 1.4% |
| -3.6781 | 1218 | 1.3% |
| -1.4106 | 1210 | 1.3% |
| 2.1239 | 1206 | 1.3% |
| -16.4992 | 1198 | 1.3% |
| -2.0392 | 1130 | 1.2% |
| 0.7106 | 1099 | 1.2% |
| -8.4192 | 1090 | 1.1% |
| Other values (196) | 82504 |
| Value | Count | Frequency (%) |
| -17.8889 | 573 | |
| -17.755 | 654 | |
| -16.5606 | 493 | |
| -16.4992 | 1198 | |
| -16.3292 | 951 | |
| -16.2553 | 1088 | |
| -15.3892 | 633 | |
| -13.8631 | 665 | |
| -13.6003 | 640 | |
| -8.6494 | 320 | 0.3% |
| Value | Count | Frequency (%) |
| 4.2156 | 652 | |
| 3.1817 | 156 | 0.2% |
| 3.1658 | 156 | 0.2% |
| 3.0967 | 156 | 0.2% |
| 3.0353 | 156 | 0.2% |
| 3.0325 | 156 | 0.2% |
| 2.8342 | 60 | 0.1% |
| 2.8267 | 132 | 0.1% |
| 2.8253 | 695 | |
| 2.8067 | 124 | 0.1% |
| Distinct | 173 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 418.5396844 |
| Minimum | 1 |
|---|---|
| Maximum | 2535 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 741.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 42 |
| median | 247 |
| Q3 | 656 |
| 95-th percentile | 1143 |
| Maximum | 2535 |
| Range | 2534 |
| Interquartile range (IQR) | 614 |
Descriptive statistics
| Standard deviation | 504.207495 |
|---|---|
| Coefficient of variation (CV) | 1.204682647 |
| Kurtosis | 4.649018778 |
| Mean | 418.5396844 |
| Median Absolute Deviation (MAD) | 230 |
| Skewness | 1.9670506 |
| Sum | 39711463.8 |
| Variance | 254225.1981 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 4 | 2794 | 2.9% |
| 1 | 2016 | 2.1% |
| 35 | 1672 | 1.8% |
| 32 | 1616 | 1.7% |
| 44 | 1391 | 1.5% |
| 5 | 1375 | 1.4% |
| 64 | 1371 | 1.4% |
| 87 | 1345 | 1.4% |
| 704 | 1329 | 1.4% |
| 25 | 1305 | 1.4% |
| Other values (163) | 78667 |
| Value | Count | Frequency (%) |
| 1 | 2016 | |
| 2 | 312 | 0.3% |
| 3 | 742 | 0.8% |
| 4 | 2794 | |
| 5 | 1375 | |
| 6 | 644 | 0.7% |
| 7 | 1300 | |
| 8 | 118 | 0.1% |
| 11 | 1033 | 1.1% |
| 14 | 796 | 0.8% |
| Value | Count | Frequency (%) |
| 2535 | 156 | 0.2% |
| 2519 | 155 | 0.2% |
| 2451 | 156 | 0.2% |
| 2400 | 155 | 0.2% |
| 2371 | 1198 | |
| 2316 | 156 | 0.2% |
| 2266 | 156 | 0.2% |
| 2247 | 156 | 0.2% |
| 2230 | 156 | 0.2% |
| 2228 | 156 | 0.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's Ļ
The Spearman's rank correlation coefficient (Ļ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate Ļ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's Ļ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (Ļ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate Ļ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. Ļ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (Ļk)
Phik (Ļk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| id_estacion | fecha | fecha_cnt | tmax | tmin | precip | nevada | prof_nieve | longitud | latitud | altitud | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | SP000003195 | 1920-01-31 | 1 | 103.0 | 22.0 | 1.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 1 | SP000003195 | 1920-02-29 | 2 | 119.0 | 37.0 | 31.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 2 | SP000003195 | 1920-03-31 | 3 | 149.0 | 50.0 | 10.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 3 | SP000003195 | 1920-04-30 | 4 | 189.0 | 76.0 | 9.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 4 | SP000003195 | 1920-05-31 | 5 | 250.0 | 120.0 | 24.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 5 | SP000003195 | 1920-06-30 | 6 | 274.0 | 149.0 | 5.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 6 | SP000003195 | 1920-07-31 | 7 | 311.0 | 165.0 | 2.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 7 | SP000003195 | 1920-08-31 | 8 | 311.0 | 168.0 | 0.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 8 | SP000003195 | 1920-09-30 | 9 | 254.0 | 134.0 | 1.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 9 | SP000003195 | 1920-10-31 | 10 | 160.0 | 84.0 | 28.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
Last rows
| id_estacion | fecha | fecha_cnt | tmax | tmin | precip | nevada | prof_nieve | longitud | latitud | altitud | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 94871 | SPW00014011 | 1967-03-31 | 3 | 172.0 | 29.0 | 5.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 94872 | SPW00014011 | 1967-04-30 | 4 | 164.0 | 38.0 | 15.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 94873 | SPW00014011 | 1967-05-31 | 5 | 194.0 | 61.0 | 12.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 94874 | SPW00014011 | 1967-06-30 | 6 | 252.0 | 101.0 | 4.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 94875 | SPW00014011 | 1967-07-31 | 7 | 345.0 | 154.0 | 0.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 94876 | SPW00014011 | 1967-08-31 | 8 | 322.0 | 141.0 | 0.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 94877 | SPW00014011 | 1967-09-30 | 9 | 257.0 | 102.0 | 5.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 94878 | SPW00014011 | 1967-10-31 | 10 | 213.0 | 90.0 | 17.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 94879 | SPW00014011 | 1967-11-30 | 11 | 131.0 | 44.0 | 29.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 94880 | SPW00014011 | 1967-12-31 | 12 | 82.0 | -23.0 | 0.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |